A Support Vector Method for Clustering

نویسندگان

  • Asa Ben-Hur
  • David Horn
  • Hava T. Siegelmann
  • Vladimir Vapnik
چکیده

We present a novel method for clustering using the support vector machine approach. Data points are mapped to a high dimensional feature space, where support vectors are used to define a sphere enclosing them. The boundary of the sphere forms in data space a set of closed contours containing the data. Data points enclosed by each contour are defined as a cluster. As the width parameter of the Gaussian kernel is decreased, these contours fit the data more tightly and splitting of contours occurs. The algorithm works by separating clusters according to valleys in the underlying probability distribution, and thus clusters can take on arbitrary geometrical shapes. As in other SV algorithms, outliers can be dealt with by introducing a soft margin constant leading to smoother cluster boundaries. The structure of the data is explored by varying the two parameters. We investigate the dependence of our method on these parameters and apply it to several data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagnosis of Heart Disease Based on Meta Heuristic Algorithms and Clustering Methods

Data analysis in cardiovascular diseases is difficult due to large massive of information. All of features are not impressive in the final results. So it is very important to identify more effective features. In this study, the method of feature selection with binary cuckoo optimization algorithm is implemented to reduce property. According to the results, the most appropriate classification fo...

متن کامل

Stock Price Prediction using Machine Learning and Swarm Intelligence

Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...

متن کامل

Prediction of Fe-Co-Mn/MgO Catalytic Activity in Fischer-Tropsch Synthesis Using Nu-support Vector Regression

Support vector regression (SVR) is a learning method based on the support vector machine (SVM) that can be used for curve fitting and function estimation. In this paper, the ability of the nu-SVR to predict the catalytic activity of the Fischer-Tropsch (FT) reaction is evaluated and the result is compared with two other prediction techniques including: multilayer perceptron (MLP) and subtractiv...

متن کامل

Prediction of Electrofacies Based on Flow Units Using NMR Data and SVM Method: a Case Study in Cheshmeh Khush Field, Southern Iran

The classification of well-log responses into separate flow units for generating local permeability models is often used to predict the spatial distribution of permeability in heterogeneous reservoirs. The present research can be divided into two parts; first, the nuclear magnetic resonance (NMR) log parameters are employed for developing a relationship between relaxation time and reservoir poro...

متن کامل

Magnetic Calibration of Three-Axis Strapdown Magnetometers for Applications in Mems Attitude-Heading Reference Systems

In a strapdown magnetic compass, heading angle is estimated using the Earth's magnetic field measured by Three-Axis Magnetometers (TAM). However, due to several inevitable errors in the magnetic system, such as sensitivity errors, non-orthogonal and misalignment errors, hard iron and soft iron errors, measurement noises and local magnetic fields, there are large error between the magnetometers'...

متن کامل

A review of Support Vector Clustering with different Kernel function for Reduction of noise and outlier for Large Database

For a long decade clustering faced a problem of noise and outliers. Support Vector Clustering is one of the techniques in pattern recognition. Support Vector Clustering is Kernel-Based Clustering. Division of patterns, data items, and feature vectors into groups (clusters) is a complicated task since clustering does not assume any prior knowledge, which are the clusters to be searched for. Nois...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000